Detecting word endings in an unknown script
نویسندگان
چکیده
منابع مشابه
Decoding Anagrammed Texts Written in an Unknown Language and Script
Algorithmic decipherment is a prime example of a truly unsupervised problem. The first step in the decipherment process is the identification of the encrypted language. We propose three methods for determining the source language of a document enciphered with a monoalphabetic substitution cipher. The best method achieves 97% accuracy on 380 languages. We then present an approach to decoding ana...
متن کاملScript Independent Word Spotting in Multilingual Documents
This paper describes a method for script independent word spotting in multilingual handwritten and machine printed documents. The system accepts a query in the form of text from the user and returns a ranked list of word images from document image corpus based on similarity with the query word. The system is divided into two main components. The first component known as Indexer, performs indexi...
متن کاملForehearing words: Pre-activation of word endings at word onset
Occurring at rates up to 6-7 syllables per second, speech perception and understanding involves rapid identification of speech sounds and pre-activation of morphemes and words. Using event-related potentials (ERPs) and functional magnetic resonance imaging (fMRI), we investigated the time-course and neural sources of pre-activation of word endings as participants heard the beginning of unfoldin...
متن کاملPreprocessing techniques for cursive script word recognition
-This paper deals with techniques for improving the recognition rate of a cursive script word recognition system. Closed-loop preprocessing techniques have been designed and implemented to achieve this objective on a limited vocabulary but with no restrictions on handwriting style. This paper discusses the details of such a system and its performance on samples from several authors. Results obt...
متن کاملMorphological Reconstruction for Word Level Script Identification
A line of a bilingual document page may contain text words in regional language and numerals in English. For Optical Character Recognition (OCR) of such a document page, it is necessary to identify different script forms before running an individual OCR system. In this paper, we have identified a tool of morphological opening by reconstruction of an image in different directions and regional de...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: BAF-Online: Proceedings of the Berner Altorientalisches Forum
سال: 2018
ISSN: 2504-2076
DOI: 10.22012/baf.2017.11